NEW LOGARITHMIC SOBOLEV INEQUALITIES AND 
AN e- REGULARITY THEOREM FOR THE RICCI FLOW 



HANS-JOACHIM HEIN AND AARON NABER 

Abstract. In this note we prove a new e-regularity theorem for the Ricci flow. Let (M n ,g(t)) with 
t 6 [— T, 0] be a Ricci flow and H XQ the conjugate heat kernel centered at a point (xo,0) in the final 
time slice. Substituting H xa into Perelman's W-functional produces a monotone function W xo (s) of 
s 6 [— T, 0], the pointed entropy, with W xo (s) < 0, and W xo (s) — iff (M,g(t)) is isometric to the 
trivial flow on R™. Our main theorem asserts the following: There exists an e > 0, depending only 
on T and on lower scalar curvature and ^-entropy bounds for (M, g(—T)), such that W xo (s) > — e 
implies |Rm| < r~ 2 on P er (a;o,0), where r 2 = |s| and P p (x,t) = B p (x,t) x (t — p 2 ,i]. 

The main technical challenge of the theorem is to prove an effective Lipschitz bound in x for the 
s-average of W x (s). To accomplish this, we require a new log-Sobolev inequality. It is well known by 
Perelman that the metric measure spaces (M, g(t), dvol 9 ( t )) satisfy a log-Sobolev; however we prove 
that this is also true for the conjugate heat kernel weighted spaces (M , g(t) , H X() (— ,t) dvol g ( t )). Our 
log-Sobolev constants for these weighted spaces are in fact universal and sharp. 

The weighted log-Sobolev has other consequences as well, including an average Gaussian upper 
bound on the conjugate heat kernel that only depends on a two-sided scalar curvature bound. 



1. Introduction 
Throughout this paper we will assume that the pair 

(M n , g (t)), te[-r,o], (i.i) 

is a smooth Ricci flow. For simplicity we assume each time slice is complete of bounded geometry. It 
may also be convenient to assume M is compact when we rely on Perelman's monotonicity formula. 
However, our results will ultimately only depend on entropy and scalar curvature bounds. 
Given a point (xq, 0) € M x [— T, 0] on the final time slice we write 

H X0 (y,s) = H(x Q , | y, s) = (47r|s|)~i exp(-f xo (y,s)) (1.2) 

for the conjugate heat kernel based at (xo,0), and 

duxoiy, s) = H X0 (y, s) dvol g{s) {y) (1.3) 

for the associated probability measures on M; see also Definition! 



Definition 1.4 (Perelman p2]). Let (M n ,g) be a Riemannian manifold. Given an / G C°°(M) and 
r > such that (4itt)~ ^e^^dvol has unit mass, we define associated entropy functionals 

W(gJ,r) = |[T(|V/| 2 + i?) + /-n](4vrT)-f e -^vol, (1.5) 
n(jg,T) = inf|w(g,/,r) : J (4ttt)~^ e~f dvo\ = lj . (1.6) 
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Perelman |18j discovered that W is nondecreasing in t = to — r (to £ if (47tt) _ a e~f evolves by 
the conjugate heat equation coupled to the Ricci flow in the time variable t < to; see Theorem 12.141 
As a consequence, the quantity fj,(g(t),to — t) is nondecreasing in t < to along any Ricci flow. 

1.1. Pointed entropy and e-regularity. Our primary concern in this paper will be a localized 
version of the W-entropy. Namely, given a Ricci flow as in ([Lip , there exists for each point (xo,0) 
in the 0-time slice and s £ [— T, 0) the canonical metric probability space (M n , g(s), dv xo (s)), where 
dv XQ {s) = H XQ (— , s) dvol g ( s -) is the conjugate heat kernel measure as in f)l .3|) . 

Definition 1.7. The pointed entropy at scale y/\s\ based at xq is defined by 

W X0 (s)=W(g(s),f X0 (s),\s\). (1.8) 

It is a consequence of Perelman's gradient formula, see Proposition 12.231 that 

linings) = 0, -^W XQ (s)>0. (1.9) 
s— >o as 

Moreover, the pointed entropy at a given point and scale vanishes if and only if the flow is isometric 
to the trivial flow on Euclidean space. One can view this as an example of a rigidity theorem. 

What is often useful in such situations is an "almost rigidity" statement: If the pointed entropy 
at a single point xq is close to 0, then the Ricci flow ought to be smoothly close to flat M. n near xq. 
There are many examples of such statements in the literature. For instance, in the case of Einstein 
manifolds one has an e-regularity theorem based on the volume ratio V x (r) = log(Vol(-B r (x))/u; n r n ) 
going back to Anderson [T], which is at the backbone of the regularity theory in [6]. 

To make all of this precise we introduce the regularity scale of a point. Given (x, t) € M x [— T, 0] 
a bound on the curvature |Rm|(x,t) at this point gives remarkably little information. On the other 
hand, a bound on the curvature in a spacetime neighborhood of (x, t) tells us everything about the 
local geometry of the Ricci flow. Thus we make the following definition: 

Definition 1.10. Given a Ricci flow (M n ,g(t)) as in (jl.ip we define the following. 

(1) Given (x, t) G M x [— T, 0] and r > with — T < t — r 2 we define the parabolic ball 

P r (x,t) = B r (x,t) x (t-r 2 ,t\. (1.11) 

(2) Given (x, t) S M x [— T, 0] we define the regularity scale 

r|R m |(rr, t) = sup{r > : sup |Rm| < r~ 2 }. (1-12) 

Pr{x,t) 

Remark 1.13. The regularity scale is defined to be scale invariant: If r|R m |(x, 0) = r and if we rescale 
the Ricci flow by r , so that P r (x, 0) is mapped to P\(x,0), then we have |Rm| < 1 on Pi(x,0). 

Remark 1.14. The regularity scale controls not only the curvature but also its derivatives. Namely, 
by standard parabolic estimates there exists for each k € N a dimensional constant C(n,k) such 
that if r|Fj, m |(x, t) = r then we have the estimates 

sup |V fc Rm| < C{n,k)r- 2 - k . (1.15) 

Pr(T,t) 

Our main theorem then takes the following form: There exists an e > such that W XQ (s) > — e 
implies ^|R m |(xo,0) 2 > e\s\. Of course the use of such an estimate is only as good as what e depends 
on, and we are able to bound e below purely in terms of T and lower scalar curvature and //-entropy 
bounds at the initial time — T, which is really the most one might hope for. 
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Theorem 1.16. For each C > there exists an e = e(n,C) > such that the following holds. Let 
(M n ,g(t)) be any Ricci flow as in (jl.lh such that 

R\g(s)]>~, inf u( 9 (s),t) > -C, (1.17) 

\S\ tS(0,2|s|) 

for some s G [-T, 0). // i/ie pointed entropy satisfies 

Wx (s) > -e (1.18) 

/or some poini xq in the 0-time slice, then we have 

r lRm \(x ,0) 2 >e\s\. (1.19) 

Remark 1.20. One typically takes the point of view that (M, g(—T)) is a fixed Riemannian manifold 
but that the runtime T of the flow is variable. One would then like to derive (|1.17p from information 
about the initial time slice (M, g(—T)). The appropriate conditions to impose are 

R[g(-T)} > ~, inf u(g(-T),T) > -C, (1.21) 

\s\ re(T-\s\,T+\s\) 

because both inf R[g(t)] and fx{g{t),tQ — t) for any fixed £g > t are nondecreasing in t. Even if one 
then only cares about fixed values of s, (jl.21f) still exhibits an explicit T-dependence in the entropy 
condition which may degenerate as T — > oo. This issue is familiar from 



The known e-regularity theorems for the Ricci flow either allow e to depend on a type I sectional 
curvature bound jS], or require a smallness condition such as (|1.18p to hold at every x in a definite 
neighborhood of xq [IT], which clearly follows from ()1.18|) under a type I condition. 

In order to understand the technical difficulties encountered in the proof of Theorem 11.161 let us 
give a brief outline of the argument. We would like to follow Anderson's proof [Tj of his e-regularity 
theorem for Einstein manifolds as much as possible. Namely, assume the theorem fails. Then for all 
e > we can find Ricci flows (M e ,g £ (t)) and points (x e ,0) such that ()1.17|) and (|1.18p are satisfied 
but (|1.19p fails. We can assume s e = 1 by rescaling. After a careful point picking we would then like 
to say that x e more or less minimizes the function r| Rm |(x,0) in a small but definite ball while still 
satisfying W Xe (l) > — 2s. Then, after rescaling so that r\n m \(x e ,0) = 1, we can take a subsequence 
converging smoothly to a complete pointed Ricci flow (Moo, 9oo(t), Xoo) such that riRmi (xoo, 0) = 1, 
yet Wj 0o (s) = for all s and hence {M^^g^t)) = R n , which is the desired contradiction. 

However, in order for this to go through we require W x (s) to depend on x in a Lipschitz manner 
(for the volume ratio V x (r) in the Einstein case this is clear by volume comparison). We are in fact 
unable to prove this assuming only (I1.17p . What does turn out to be possible under (I1.17p . though, 
is to obtain Lipschitz control on the s-average ofW s (x), which is still enough for our purposes. To 
accomplish this we introduce new log-Sobolev estimates for heat kernel measures. In addition, this 
s-average turns out to be an interesting monotone quantity in its own right; we will put this to some 
use in obtaining new integral bounds for the conjugate heat kernel, see Theorem 14.51 



1.2. Log-Sobolev inequalities for the conjugate heat kernel measure. Consider a smooth 
metric probability space (M,g,dv), where dv = e~fdvol g . If the Bakry-Emery condition 

Ric + V 2 />^5 (1-22) 
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is satisfied, then a celebrated classical theorem from [2] asserts that (M, g, dv) satisfies a log-Sobolev 
inequality. That is, for every smooth function with compact support on M we have 

J(j> 2 dv = l J (f> 2 log (f) 2 dv < 4 J \V<P\ 2 dv. (1.23) 

The case of flat R n equipped with the Gaussian measure dv = (4n)~2 exp(—j\x\ 2 )dx is already very 
interesting; this case is equivalent to the original log-Sobolev inequality proved by Gross [10] . 

The log-Sobolev (|1.23|) can be used to prove that the operator Af(ft = Acf) — (V/, V</>), which is 
(i^-selfadjoint, has discrete spectrum. Linearizing (jl.23p around (ft = 1 yields a Poincare for dv that 
tells us that the smallest positive eigenvalue of A f is at least and it follows from [2] that equality 
holds if and only if M splits off a line, in which case the ^-eigenfunction is linear. 

Remark 1.24. If M is compact, an improvement depending on diamM for the spectral gap ^ was 
recently proved in [9]. Intriguingly, on every nontrivial normalized gradient shrinking Ricci soliton, 
Ric + V 2 / = ^g, the soliton function / itself is always a 1-eigenfunction of A^. 

In the context of a Ricci flow (M n ,g(t)) with its conjugate heat kernel measures (jl.3p . Perelman's 
monotonicity formula (|2.16p would suggest that the metric probability spaces (M, g(s),dv xo (s)) are 
typically well approximated by shrinking solitons. Thus the Bakry-Emery inequality (11.23H may at 
least help to motivate (if not prove) the following result, which is our main technical tool. 

Theorem 1.25. Let (M n ,g(t)) be a Ricci flow as in ([L~T]) . Fix x G M and s G [-T,0). 
(1) For all <j) G C£°(M) with J (j)dv XQ (s) = 0, 



<p 2 dv X0 (s) < 2\s\ J \VcP\ 2 g{s) dv X0 ( S ). (1.26) 

Equality holds if and only if either (p = 0, or (M,g(t)) = (M',g'(t)) x (M, dz 2 ) isometrically for all 
t € [s, 0] with z{xq) = and (j) = Xz for some constant A G R*. 
(2) For all <j> G C^°(M) with f (j) 2 dv Xo {s) = 1, 

h 2 log 4> 2 dv xo (s) <A\s\f \V(f>\ 2 g{s) dv X0 (s). (1.27) 

Equality holds if and only if either <f> = 1, or (M,g(t)) = (M',g'(t)) x (R, dz 2 ) isometrically for all 
t G [s, 0] with z(xq) = and <p = exp(Az — 2A 2 |s|) for some constant A G R*. 

Remark 1.28. Perelman's monotonicity formula implies an unweighted log-Sobolev inequality (|2.19p 
whose optimal constant depends on T and on various bounds for the geometry of (M, g(—T)). The 
weighted inequalities in Theorem 11.251 on the other hand are sharp and completely universal. As far 
as we can tell, however, the applications of (|2.19p and (jl.27p are essentially disjoint. 

Remark 1.29. A static version of Theorem II .251 for the heat kernel measure on complete Riemannian 
manifolds with Ric > was proved independently by Bakry-Ledoux [3] and (with a spurious extra 
factor of n) Bueler [3]. These papers inspired our proof of Theorem 11.251 

1.3. Integral and pointwise bounds for the conjugate heat kernel. Before returning to our 
discussion of the e-regularity result, Theorem 1 1.1 6| we wish to explain an interesting consequence of 
Theorem 11.251 The basic idea is to test (jl.27p with functions (ft that are well adapted to the metric 
geometry; this is the so-called "Herbst argument" in metric measure theory [14] . 
The following appears to be the sharpest possible result such methods can yield. 
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Theorem 1.30. Let (M n ,g(t)) be a Ricci flow as in (jl.ip and let dv = dv XQ {s) be a conjugate heat 
kernel measure as in (|1.3j) . Then the Gaussian concentration inequality 

u(A)v{B) < exp f __Ldist flW (4 Bf^j (1.31) 

holds for all A,B C M . Here dist refers to the usual set distance, not the Hausdorff distance. 

Let us observe the following natural consequence. Given x\,X2 6 M in the s-time slice, we can 
apply (jl.3ip to dv = dv X2 , choosing A, B to be the metric balls B r {x\, s), B r (x2, s) with r 2 = \s\. If 
B r (xi,s) is in addition noncollapsed with a uniform constant, which is of course a fair assumption 
after Perelman [18] , then we obtain the following average Gaussian upper bound: 



The primary concern with this estimate is a lack of effective lower bound on v X2 (B r (x2, s)). We 
can fix this to some extent by bringing in a pointwise Gaussian lower bound for H X2 from [21], see 
Theorem 12.371 however this forces us to work with time-0 balls, not time-s balls. 

Corollary 1.33. For each C > there exists a C = C'(n, C) > such that the following holds. Let 
(M n ,g(t)) be any Ricci flow as in (jl.ip such that, for some s S [— T, 0), 

sup ||%(t)]||oo < n> , inf , „Ms),t) > -C (1.34) 

te[s,0] \ s \ re(0,2\s\) 

Let x\,X2 S M and put r 2 = \s\. Then we have an average Gaussian upper bound 

£ H X2 (s) dvol g{s) < C'\s\-%exp ^--^dist 3(s) (B r (xi,0),B r (x 2 ,0)) 2 J . (1.35) 

J _B r (xi,0) V 1^1 / 

As a consequence, we get the following distance distortion type estimate: 

dist g ^(B r (x 1 ,0),B r (x 2 ,0)) < C r/ d fl ( )(x 1 ,x 2 ). (1.36) 

Remark 1.37. Both (jl.32p and fj 1 .35 j) fall short of what one might hope to have. For example, there 
is a method of proving pointwise Gaussian upper bounds for heat kernels on static manifolds relying 
on nothing more than a log-Sobolev [7]. Unfortunately this approach seems to break down for the 
Ricci flow for lack of control on the distance distortion between different time slices. 

1.4. Lipschitz continuity of the pointed Nash entropy. We now return to the main flow of 
the argument and explain how the Poincare inequality of Theorem ll.25l fl) helps us to complete the 
proof of Theorem 11.161 As we said at the end of Section 11.11 what we require is a Lipschitz bound 
in x for W x (s), or at least for a weaker quantity than W x (s) that still controls the soliton behavior 
of our Ricci flow near x. It turns out that we can work with the time average of W x -(s): 

Definition 1.38. Given xq € M and s € [— T, 0), we define the pointed Nash entropy by 

AU*) = A [ W*o(r)dr= [ f xo (s)dv xo (s)-^. (1.39) 
l s l Js Jm 1 

See Proposition 12.261 for the equality in (|1.39p and other basic properties. Thus, M XQ (s) is closely 
related to the quantity used by Nash [TB] in proving Holder continuity of weak solutions. 
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Theorem 1.40. For each C > there exists a C = C'(n, C) > such that the following holds. Let 
(M n ,g(t)) be any Ricci flow as in (11.11) such that 

R[g(s)} > ~ inf a(g(s),r) > -C, (1.41) 

\S\ tS(0,2|s|) 

for some s G [— T, 0). Then the mapping 

(M,g(0)) 3i4 f x (s)H x (s) G L 1 {M , dvo\ g{s) ) (1.42) 
is globally C'\s\~z -Lipschitz. In particular this implies that 

\M Xl (s) - M X2 (s)\ < C'\s\-h m (x 1 ,x 2 ). (1-43) 

The idea here, and the reason for introducing N x in the first place, is that M x , unlike W x , does 
not depend on derivatives of H. Thus, V x Af x can be bounded by the gradient estimates of \19\ I20j. 
The Poincare (jl.26p comes in as a crucial tool to control the L 2 -norm of f x (s) in the process. 

Finally, with Theorem 11.401 in hand it is not difficult to complete the proof of Theorem 11.161 (or in 
fact of a slightly strengthened version in which W Xo gets replaced by M Xo ) because the contradiction 
argument outlined in Section 11.11 works the same way with M xo in place of W xo . 

2. Background material 
2.1. The heat operator and its conjugate. Let (M n ,g(t)) be a Ricci flow as in (jl.ip . 

Definition 2.1. The heat operator and its conjugate along the flow are defined by 

□ = d t - A, (2.2) 
□* = —d t — A + R. (2.3) 

This is a sensible definition because of the following identity. 

Lemma 2.4. Let Q, be a smooth bounded domain in M and [t\,t<2\ Q [— T, 0]. Then 



2 [ (Du)v - (D*v)u 
for all smooth functions u,v : M x [ii,t2] - 





*2 rt-2 r i 


/ uv 




In 





du dv 
dn dn 



Definition 2.6. For x,y G M and s < t in [— T, 0], we let H(x,t\y,s) denote the conjugate heat 
kernel based at (x,t), i.e. the unique minimal positive solution to the equations 

D* y)S H(x,t\y,s) = (-d s - Ay )g{s) + R(y,s))H(x,t\y,s) = 0, (2.7) 

UmH(x,t\y,s) = 5 x {y). (2.8) 

If we wish to fix a point x G M we may write H x (y, s) = H(x, | y, s) = (47r|s|) _ 2 e -fx(u> s ) . 

Lemma 2.9. The conjugate heat kernel satisfies the following properties. 
(!) J H(x,t\y,s)dvol g{s) (y) = 1. 

(2) H (x, 1 1 y, s) is also the fundamental solution of \H x j = dt — A x ^ g ^ with pole at (y, s). 

(3) f H(x,t\y,s)dvol g( f ) (x) < exp(p(t - s)), where p = \\R[g(-T)]- \\oo. 
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Proof. (1) is simply the mass conserving property of the conjugate heat equation. 

(2) requires some more care, and indeed reflects a very general fact concerning the fundamental 
solutions of parabolic operators and their formal adjoints. Let us write 

u = H a (-,-\y,s), (2.10) 
v = H D *(x,t\-,-), (2.11) 

for the fundamental solutions of □ and □* with poles at (y, s) and (x, t) respectively. We then apply 
(12. 5|) to u and v on the following three domains in the limit as s — > 0: 

[s, s + e 2 } x (M \ B £ (y, s)), [s + e 2 ,t — e 2 ] x M, [t — e 2 ,t] x (M \ B £ (x, t)). 

Using standard local asymptotics for u and v at their poles, this yields u(x,t) = v(y,s). 

(3) follows by differentiating f H(x, t \ y, s) dvol g ^(x) by i, substituting O x>t H = from (2), and 
using that the minimum of the scalar curvature is nondecreasing along the Ricci flow. □ 

Finally, we recall a key tool that has already been put to good use in [15|, [T9] . 

Lemma 2.12. The following parabolic Bochner formula holds for all spacetime functions u: 

\J^\Vu\ 2 = -|V 2 n| 2 + (Vtt.VDit). (2.13) 

Proof. Using -§[\Vu\ 2 = 2(V^u,Vu) + 2Ric(Vu, Vu), this reduces to the usual computation. The 
Ricci term here cancels with the Ricci term from the standard elliptic Bochner formula. □ 

2.2. Properties of the entropy functionals. We reviewed the definitions of Perelman's entropy 
functionals W(<?, /, t) and fi(g,r) in Section [TJ In addition we introduced localized versions W XQ (s) 
and J\f Xo (s). In this section we collect basic properties and applications of these functionals. 
We begin with Perelman's foundational monotonicity formula and two corollaries \13\ I18j. 

Theorem 2.14. Fix a smooth probability measure dv on M and let f{t), g(t) be families of functions 
and metrics on M parametrized by t S [-T, 0]. Fix any to £ K and put r(t) = to — t. If 

^ = -2(Ric + V 2 /), (4ttt)~£ e-^vol = dv, (2.15) 
for all t < to, then we have 

-W(g,f,r) = 2r J Ri c + V 2 f-^ dv. (2.16) 

Up to correcting by the diffeomorphisms generated by V/, (|2.15j) is equivalent to g evolving by 
Ricci flow and (Attt)~ 2e~f evolving by the conjugate heat equation associated with this Ricci flow. 
For the rest of the present section, (M n ,g(t)) will denote a Ricci flow as in (jl.ip . 

Corollary 2.17. For all to £ the quantity fj,(g(t),to — t) is nondecreasing int < to. The quantity 
is constant if and only if the flow is isometric to a gradient shrinking soliton with singular time to 
and soliton function f(t), where f{t) denotes any minimizer in the definition of fi(g(t),to — t). 

Corollary 2.18. For a fixed to £ M, define /io = fJ>(g(—T),to + T) and write t = to — t. Then 

J 4> 2 log cf) 2 dvol <t J (4|V0| 2 + R<p 2 ) dvol - | log4vrr - n - no (2.19) 
holds for all <f> G C^°(M) with f (j) 2 dvol = 1 as long as t < t . 
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We will then also need to recall how (|2.19p implies Perelman's no local collapsing theorem |18j . 
We state an improved version of this result that only requires an upper scalar curvature bound |13j . 
As reported in [13], this is due to Perelman as well. The way we organize the proof may be slightly 
simpler than the version in |13| and was inspired by an argument in [5]; see also |20] , 

Theorem 2.20. Fix any t G [— T, 0], x G M, and r > 0, and suppose that we have 

inf fi(g(-T),t + T + p 2 ) > -C, sup R[g(t)} < Cr~ 2 . (2.21) 
pe(0,r) B r {x,t) 

Defining k = exp(— (2 n+4 + 2C)) it then follows that 

\B r (x,t)\> Kr n . (2.22) 

Proof. We work in the i-time slice. For p G (0, r] define a Lipschitz function ip to be = 1 on B(x, |), 
= off B(x,p), and linear in d(x, — ) in between, and apply (I2.19P to 4> = VVHV'lh- Using Jensen's 
inequality with respect to r^^ji on B(x,p) to bound the left-hand side from below, we obtain 

1 ^ 16r f\B(x,p)\ \ Ct , , 

This holds for any given r > 0, with po depending on r by definition. We now make r = p 2 . Using 
our definition of k, it is then easy to prove the following implication: 



*5 



(|)" => |B(x,p)|> K p». 



Since k is smaller than the volume of 5^(1), the claim follows from this by iteration. □ 

Finally, we summarize some basic properties of the localized entropies W x (s) an d N x (s) that we 
introduced in Definitions 11.71 and 11.381 The first couple of facts are clear from Theorem 12.141 

Proposition 2.23. The following hold for all x G M and all s G [— T, 0). 

(1) lim^oHUs) = 0. 

(2) p(g(—T),T) < W x (s) < 0. 

(3) W x (s) = - /° 2|r| / |Ric + V 2 /x - 2^| 2 c&/*(r) dr. 

The following straightforward computation explains the use of the Nash entropy. 



Lemma 2.24. Let u be a smooth positive solution to the conjugate heat equation, of rapid decay and 
of unit mass. Fix to G R, put r = to — t for t < to, and write u = (4ttt)~ 2 e~f . Then 

y u log u dvol^ = W(g, f, r) + n + ^ log 4vrr. (2.25) 

Let us then summarize what we learn from this together with Proposition 12.231 

Proposition 2.26. The following hold for all x G M and s G [— T, 0). 

(1) W x (s) < M x (s) < 0. 

(2) *-M x (s) = ±(N x ( S )-W x ( S ))>0. 

(3) N x (s) = - /log H x (s)dv x {s) - § (l + log47r|s|) = / f x (s) dv x (s) - §. 

(4) N x {s) = - /° 2|r|(l - §) / |Ric + V 2 /x - di^r) dr. 
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2.3. Heat kernel estimates. Here we provide careful statements and applications of some useful 
estimates due to Zhang [19j [20l [21] . As usual, we let (M n ,g(t)) denote a Ricci flow as in (11.11) . 

In [20], Davies's method [7j for deriving L°° heat kernel estimates from log-Sobolev inequalities 
is applied to the Ricci flow, based on Corollary 12.181 During the proof, the scalar curvature term in 
(I2.19P gets compensated by the evolution of the Riemannian measure, so that the final result holds 
without any upper scalar curvature assumptions. 

Theorem 2.27. Define two auxiliary quantities 

p^WRM-T)]-]^, p = inf p(g(—T), r). (2.28) 

rG(0,2T) 

Let u : M x [11,12] — > 1R + with [£1, ^2] ^ [ — T,0] be a smooth positive solution to 

For each t G [11,12] w a then have 

IKi)||oc < (4vr(t - h))-%e* t - tl '>-' t \\u(t 1 )\\ 1 . (2.30) 

Hamilton [11] proved a Harnack inequality for positive solutions to the heat equation on manifolds 
with lower Ricci bounds that only involves the gradient in the space variables. In [19] this idea was 
applied to the Ricci flow, using the Bochner formula of Lemma 12.121 as the key tool. 



Theorem 2.31. Let u: M x [ii,t2] — > ^ + with [£1 , £2] Q [— T, 0] be a smooth positive solution to 

du 
di 



du , 
7i7 = \tt)u. (2.32) 



Then we have a spatial Harnack estimate 



vwiog supu 



< ; 1 . (2.33) 



u 

The following corollary was not stated in [19] but will be useful for us in Section HJ 
Corollary 2.34. For each C > there exists a C = C'(n, C) > such that if we have 

R[g{s)\ > ~, inf u(g(s),T) > -C, (2.35) 

|S| rS(0,2|s|) 

and if we write H(x, 1 y, s) = (4ir\s\)~ 2 exp(— f x (y, s)) as before, then 

\V x f x \ 2 < S(C + f x ). (2.36) 
\s\ 

Proof. Fix y, s and let u(x, t) = H(x, t\y,s), so that u solves the heat equation by Lemma 12.91 We 
can apply Theorem 12.271 with T = \s\ and [ti, t?\ = [s + e, 0] for e — > to conclude that 

u = sup u < C \s\ 2 . 

[§,0]xM 

On the other hand, Theorem E3U with [t u t 2 ] = [|,0] yields |V X ./ X | 2 < log f at (x,0). □ 

Finally, in [21], Perelman's Harnack inequality from [T5] is used to prove a Gaussian lower bound 
for H in terms of distance in the final time slice, by bringing in Theorems 12.311 and 12.271 
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Theorem 2.37. Define p, fj, as in (|2.28p and write r = t — s for s < t in [— T, 0]. Then 

H(x, t\y,s) > (8vrr)^t e xp (~d g (t) (x, yf - J y/aR{y, t - a) da - pr + fij . (2.38) 
Notice that (|2.38|) involves the g(i)-distance because Theorem 12.311 bounds the x-gradient. 

3. LOG-SOBOLEV AND GAUSSIAN CONCENTRATION 

Section [3.11 proves Theorem 11.251 using methods in the spirit of the papers OH], which deal with 
the analogous problem for static Riemannian manifolds with Ric > 0. In fact, this proof is not far 
removed from the usual proof of a log-Sobolev under the Bakry-Emery condition ()1.22j) . Section 13.21 
deduces the Gaussian concentration (Theorem ll.3U|) , using a standard argument from the theory of 
log-Sobolev inequalities. Corollary 11.331 is then deduced as a consequence. 

3.1. Proof of the Poincare and log-Sobolev inequalities. The starting point is to rewrite our 
two inequalities in a more convenient way. Writing dv = dv XQ (s) and passing to square roots in the 
log-Sobolev, we need to prove that for all u E Cq°(M), with u > in the second case, 

J u 2 dv - udv^j <2\s\ J \Vu\ 2 dv, (3.1) 
u log udv — ( [ udv J log ( f udv J < \s\ f — dv. (3-2) 



u 

Note that (|3.2p in fact implies (|3.ip by linearizing around u = 1, but information about the equality 
case is lost in this way. However, we will prove (|3.ip and (|3.2p completely in parallel, with almost 
identical discussions of the respective equality cases. 

The key insight, not unlike [2], is that the heat kernel provides a homotopy between the two terms 
that are being subtracted on the left-hand sides of (|3. 1 1) . ()3.2p . The proof then reduces to deriving 
a gradient estimate for the forward heat equation via the Bochner formula (Lemma 12. 12|) . 

In [4], this is done (in the static case) by bounding the Hodge heat kernel on 1-forms in terms of 
the heat kernel on scalars using a Kato type inequality; unfortunately, see |12| . this method loses a 
factor of n = rank T*M, which is not accounted for in [4]. While the same idea works for the Ricci 
flow, we will therefore instead follow in spirit the approach of [3] , where a precise gradient estimate 
is obtained (in the static case) by applying the heat kernel homotopy principle once again. 

For s < t in [— T, 0], we write P s tu for the evolution of u G Cq°(M) from time s to time t under 
the forward heat equation coupled to the Ricci flow. In other words, by Lemma 12.91 



{P st u)(x) = J u(y)H(x,t\y,s)dvol g ( s )(y). (3.3) 
Given this, the following lemma records the key homotopy principle. 
Lemma 3.4. (1) For any family of smooth functions Ut parametrized by t G [— T, 0], 

^-PtoUt = Ptoa t U t . (3.5) 
at 

(2) Let u G Cq°(M) and put ut = P s tu, so that OtUt = 0. Fix <j), ip : M — > R. Then we have 

U t = cj){u t )^UU = -<f>"(u)\Vu\ 2 , (3.6) 

U t = ijj(u t )\Vut\ 2 g{t) ^DU= -2^(u)\V 2 u\ 2 - 4iP'(u){V 2 u, du ® du) - if)"(u)\du ® du\ 2 , (3.7) 
omitting all subscripts t and g(t) on the right-hand side. 
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Proof. (1) The representation formula (|3.3p yields 

d d f 

—P m U t {xo) = — / U t (x)H(x ,0\x,t)dvol g( t)(x). 

Now use that H{xq, | x, t) solves the conjugate heat equation in (x, t) and Jjdvol = — R dvol. 
(2) The first claim is straightforward. The key to the second claim is Lemma 12.121 □ 

Combining (|3,5p . (|3.6p . we can now rewrite the left-hand sides of (|3,ip . (|3.2p as follows: 



n 



d 



(u)du - <f) ( / udv ) = - / —P t0 (^(P st u))(x )dt 

P to (ct>"(P s tu)\VP s tu\ 2 g{t) )(xo) dt, (3.8) 

j s 

with (p(x) = x 2 and cft(x) = xlogx, respectively. We next estimate the integrand by combining (|3.5p . 
(j3.7|) with ^ = <^>", replacing t, by r, £ for a new variable r G [s, t]. Thus, in the a; 2 case, 

|VP st u|* (t) = P st (\Vu\ 2 g{s) ) - 2 T P rt | V 2 P sr u\ 2 g(r) dr. (3.9) 

Substituting this into (j3.8|) immediately yields the Poincare inequality of Theorem 11.25( 1). In the 
x log x case, (|3.7p simplifies quite drastically and we obtain 



\^ p sM 2 g{t) _ (Wu\ 2 n 



PstU 



Pst ( U U 9[S) j - 2 J* P rt ((P sr u)\V 2 logP sr u\ 2 gir) 



dr. (3.10) 



Again we substitute this into (|3.8p to prove the required log-Sobolev inequality. We can then finish 
the proof of Theorem 11.251 by observing that equality can occur if and only if either u = const, or 
the flow lines of Vu or V log u split off as isometric M-factors. 

3.2. Gaussian concentration. Given the inequalities in Theorem 11.251 proving the concentration 
estimate claimed in Theorem 11.301 is by now a standard exercise in abstract metric measure theory: 
log-Sobolev inequalities yield Gaussian concentration (Herbst), and Poincare inequalities, which are 
weaker, still yield exponential concentration (Gromov-Milman). We refer to Ledoux [JJ] for a good 
exposition; the following proof merely recalls the relevant points from |14j . 



Proof of Theorem \1.30L We proceed from the log-Sobolev in the version (|1.27p . The first idea is that 
this allows us to bound, in a specific fashion, the Laplace transform of 1-Lipschitz functions with 
zero average. The second idea is to apply this bound to the distance function from a set. 
As for the Laplace transform bound, fix F S C°°(M) with 



Fdu = 0, |VF| < 1. 
Let us define a Laplace type transform, or moment generating function, by 



U{\) = j log J e XF dv. 



This is 0(A) as A — > 0. Moreover, (jl.27p applied to (j) 2 = e XF / f e XF dv easily yields 

dU . 

for all A > 0. Thus, altogether, 

" z XF dv < e^ 2 . 



I' 
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We now apply the preceding inequality to the two test functions 

F = ±{G - j Gdv), G(y) = dist(y,B). 
Then we immediately obtain that 

e M*(A,B) u ( A )v(B) < [ [ e x ^- F ^dv{ yi )du{y 2 )<e 2 \ s \ x \ 

J A J B 

and the theorem follows from this by optimizing in A. □ 

Proof of Corollary 1 1.331 We begin by applying Theorem 11.301 with xq replaced by x 2 and 

A = B r (xi,0), B = B r (x 2 ,0). 

In order to derive (|1.35p we must bound the factor v(B) on the left-hand side of (jl.3ip from below. 
This can be done using two ingredients. First, using Zhang's Theorem 12.371 



Second, the evolution of the volume form under Ricci flow and Theorem 12.201 tell us that 

Vol g{s) (B r (x 2 ,0)) > ^Vol g(0) (£ r (x2,0)) > ^r n . 

Together these show that v{B) > ^7. Then (jl.35|) follows by dividing through by \ol g ^(B r (xi, 0)), 
which we can bound from below by the same argument we just used for Vol g ^(B r (x 2 , 0)). 
Finally, Theorem 12.371 also tells us that 

1 n ( C' \ 

inf H X2 (s) > — |s|~2 exp -— d g{0) {xi,x 2 ) 2 , 

B r (xi,0) U \ |S| J 

which allows us to deduce (jl.36p from (jl.35p . □ 

4. LlPSCHITZ CONTINUITY OF THE POINTED NASH ENTROPY 

We now prove Theorem 11.401 as a consequence of Corollary 12.341 and our weighted Poincare (jl.26[) . 
The standing assumption throughout this section is that (M n ,g(t)) is a Ricci flow parametrized by 
t G [— T, 0] satisfying (|1.4ip . In order to prove that the mapping x f x (s)H x (s) from (M, <?(0)) to 
L 1 (M, <ivol 9 ( s )) is Lipschitz, clearly all we need to do is estimate the integral 



1^ J \V x (f x (y,s)H x (y, S ))\dvo\ g{s) (y). (4.1) 

Inserting the expression for H x in terms of f x , and writing dv = du x (s), we find that 

X = J \V x f x ~ f x V x f x \ dv < \\V x f x \\ 2 (l + ||/ x || 2 ), (4.2) 

where the subscript 2 indicates the L 2 -norm with respect to the probability measure v. Now using 
Corollary 12.341 we have the pointwise estimate 



IV^I 2 < ^(C + f x ). (4.3) 
\s\ 

If we substitute this into (|4.2p we therefore get 

X<C"|s|~3 ^1 + y \f x \ 2 diA . (4.4) 
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To deal with this term we need the Poincare inequality (jl.26p and some simple observations based 
on monotonicity of the Nash entropy. Precisely, the following lets us complete the proof. 

Theorem 4.5. Under (|1.4ip the following hold. 

(1) ff*dve[%-C,2]. 

(2) f\Vf x \*d»<(% + C)± 

(3) f\f x \ 2 dv<2(n + 2C) 2 . 

Notice carefully that V/ x means V y f x here, not V x f x as in (|4.2I) . 

Proof of Theorem \4-5\ The first statement follows from Propositions 12.231 and [27261 For the second 
statement, notice that the inequality W x (s) < Af x (s) is equivalent to 

J (\^f x \ 2 + R)dv<^. 

Finally, the Poincare inequality of Theorem 11.251 gives us 

f |/x| 2 ^<2| S | j \Vf x \ 2 du+^J f x dv} , (4.6) 

and the two terms on the right-hand side are bounded by the first two statements. □ 

5. Proof of the e-regularity theorem 

Throughout this section we are considering a Ricci flow (M n ,g(t)) as in (jl.ip that satisfies (|1.17p . 
We begin by proving Proposition 15. 2| which is a more restrictive version of Theorem II. 161 and then 
use the continuity statement of Theorem 11.401 to complete the proof of Theorem 11.161 in full. 

For a point y G M on the 0-time slice we define the normalized time-scale 

t(y) = -min{T,r| Rm |(y,0) 2 }. (5.1) 
Proposition 5.2. There exists an e = e(n, C) > such that if we have 

Vy € B s (x,0) : N t(y) (y) > -e (5.3) 

for some x G M and < 5 < \[T , then we also have 

Vy e B s (x,Q) : r| Rm |(y,0) > e ■ d g{0) (y, dB s (x, 0)). (5.4) 

Proof. (1) By rescaling there is no harm in assuming 6 = 1 < T. Let us assume for some n, C > 
that the lemma fails. In this case we have for all i E N that there exists a complete Ricci flow 

{M?, gi {t),( Xi ,0)) 

with t £ [—1,0] and x% € Mj such that for each y G B\ {x% , 0) we have 

M t[y) (y)>-- V (5.5) 

whereas any point yi G -Bi(xj,0) minimizing the quantity 

f v _ r lRm{ (y,0) 

«fli(o)C2/,oBi(xi,0)J 



must necessarily satisfy 



< w{ yi ) < i (5.7) 
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(2) Choose any such yi, define = ^|R m |(yi, 0), and consider the rescaled Ricci flows 
(Mi,gi(t),{yi,0)), gi(t) = ^ gi (rft), te[-\,0}. 

Certainly r| Rm |(yj, 0) = 1, and y~i moves away from the boundary in that 

1 i 

^dg m (yi,dB^(xi,0)) > - 

On the other hand, since yi minimizes w, we find that 

1 

2 

Moreover, by Perelman's no local collapsing, see Theorem 12.201 we have that 



d i = o%(o)fe. 35 lfe.°)) ^ o- ( 5 - 8 ) 



V G B d .(yi,0) =^ r ]Rml (y,0) > -. (5.9) 



Vol s . (0) (Bi(y,0))>K(n). (5.10) 

Thus we have uniform smooth bounds on Pi/i(y, 0) for all y G 5^(^,0), and uniform noncollapsing 
at scale 1 on 5^(^,0). This allows us to pass to a subsequence to derive a pointed C°° limit 

(M i5 &(*),(&,())) -> (Moo,5oo(t),(yoo,0)). 

The limit exists for t G [— yj^O] and is complete with bounded curvature while satisfying 

r|Rm|(yoo,0) = l. (5.11) 

(3) To contradict this with ()5.5[) we need a few estimates for the heat kernel. Namely, 

(4^|t|)-texp(-ZL i J(y)) <H(yi,0\y,t) <C'(n,C)\t\S (5.12) 

for all t G [—1,0) and y G Mi in the rescaled flows (Mi,g~i(t), (y~i,0)), where I denotes the reduced 
length function of Perelman. The lower bound follows from Perelman's Harnack inequality [18], and 
the upper bound follows from Zhang's Theorem 12.271 originally proved in [20] . 

To clarify this point, note that in principle the required heat kernel estimates already follow from 
(15. 9|) on BdiOji, 0) and neither depend on any special structure of the Ricci flow nor on the assumed 
value of C. We used some results from [181 GO] instead just for the sake of simplicity. 

(4) As usual, let us write 

H(y h 0\y,t) = (4vr|t|)-fe- / ^^. (5.13) 
Then the two bounds in ()5. 12[) together with the local regularity f)5.9[) tell us that 

h(y,t) ^ foo(y,t) (5.14) 

smoothly on compact subsets because the heat kernels Hg. satisfy uniform derivative bounds. Now 
the entropy smallness (|5.5f) together with Proposition 12.261 shows that 

dvy^dt < y. (5.15) 



[ 2|t|(l-16|t|) / 

J~h ■> M i 

16 1 



Thus, by Fatou's lemma, the function foo constructed in (|5.14p is a soliton potential for the limiting 
Ricci flow with singular time t = 0. Hence the t-time slice of the limiting flow is isometric to the 
16|t|-rescaling of the (— ^)-time slice. But then the only way for the curvature to stay bounded as 
t —> is if (Moo, <7oo(i)) is flat for all t. This contradicts (|5.11j) and thus proves the lemma. □ 

We are now in good shape to in fact prove a strengthening of our main theorem. 
Theorem 5.16. Theorem 1 1. 16\ is true and we can even replace ([1.180 by M XQ {s) > —e. 
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Proof. Define 5 = min{l, icg^^ETJ]} an< ^ e = § £ |5T2} ^ e can assume — s = T = 1 > 5. Then 

Vx G B s (x , 0) : A4(l) > -£[52] 
by Theorem 11.401 Thus Proposition 15.21 tells us that r\R m \(xo,0) > £ [5^ > e. □ 
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